Search CORE

124 research outputs found

Relative Value Iteration for Stochastic Differential Games

Author: A Arapostathis
A Arapostathis
DJ White
GK Basak
M Gruber
P Whittle
SP Meyn
VE Beneš
VS Borkar
VS Borkar
WH Fleming
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/04/2013
Field of study

We study zero-sum stochastic differential games with player dynamics governed by a nondegenerate controlled diffusion process. Under the assumption of uniform stability, we establish the existence of a solution to the Isaac's equation for the ergodic game and characterize the optimal stationary strategies. The data is not assumed to be bounded, nor do we assume geometric ergodicity. Thus our results extend previous work in the literature. We also study a relative value iteration scheme that takes the form of a parabolic Isaac's equation. Under the hypothesis of geometric ergodicity we show that the relative value iteration converges to the elliptic Isaac's equation as time goes to infinity. We use these results to establish convergence of the relative value iteration for risk-sensitive control problems under an asymptotic flatness assumption

arXiv.org e-Print Archive

Crossref

Comparison of Random Walk Based Techniques for Estimating Network Averages

Author: A Nazi
C Robert
E Nummelin
E Volz
J Abounadi
K Avrachenkov
MJ Salganik
P Billingsley
P Brémaud
S Goel
SM Ross
VS Borkar
VS Borkar
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/08/2016
Field of study

International audienceFunction estimation on Online Social Networks (OSN) is an important field of study in complex network analysis. An efficient way to do function estimation on large networks is to use random walks. We can then defer to the extensive theory of Markov chains to do error analysis of these estimators. In this work we compare two existing techniques, Metropolis-Hastings MCMC and Respondent-Driven Sampling, that use random walks to do function estimation and compare them with a new reinforcement learning based technique. We provide both theoretical and empirical analyses for the estimators we consider

Crossref

INRIA a CCSD electronic archive server

Geometrical Insights for Implicit Generative Modeling

Author: A Auffinger
A Gretton
A Müller
AA Zinger
B Schölkopf
B Sriperumbudur
BK Sriperumbudur
BK Sriperumbudur
C Villani
D Sejdinovic
GJ Székely
H Cramér
IJ Schoenberg
JM Hammersley
MA Aizerman
N Aronszajn
N Fournier
P Milgrom
R Mises von
RJ Serfling
RM Neal
ST Rachev
Steffen Dereich
T Hastie
VS Borkar
X Nguyen
Publication venue
Publication date: 21/08/2019
Field of study

Learning algorithms for implicit generative models can optimize a variety of criteria that measure how the data distribution differs from the implicit model distribution, including the Wasserstein distance, the Energy distance, and the Maximum Mean Discrepancy criterion. A careful look at the geometries induced by these distances on the space of probability measures reveals interesting differences. In particular, we can establish surprising approximate global convergence guarantees for the

1

-Wasserstein distance,even when the parametric generator has a nonconvex parametrization.Comment: this version fixes a typo in a definitio

arXiv.org e-Print Archive

Crossref

Compactness of the space of non-randomized policies in countable-state sequential decision processes

Author: AB Piunovskiy
AS Kechris
AS Nowak
E Altman
EA Feinberg
EA Feinberg
EB Dynkin
EJ Balder
Eugene A. Feinberg
H Nikaido
HL Royden
M Schäl
M Schäl
M Schäl
O Hernandez-Lerma
P Billingsley
R Strauch
RC Chen
RC Chen
Richard C. Chen
VS Borkar
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

N—Person Stochastic Games: Extensions of the Finite State Space Case and Correlation

Author: A Federgruen
A Jaskiewicz
AS Nowak
AS Nowak
AS Nowak
AS Nowak
AS Nowak
AS Nowak
AS Nowak
AS Nowak
AS Nowak
AS Nowak
C Castaing
C Harris
CJ Himmelberg
CJ Himmelberg
CJ Himmelberg
CJ Himmelberg
D Duffle
DP Bertsekas
E Altman
E Altman
E Altman
E Solan
F Forges
FM Spieksma
H-U Kiienle
H-U Küenle
IE Glicksberg
J-F Mertens
J-F Mertens
J-F Mertens
K Kuratowski
LD Brown
LI Sennott
LO Curtat
N Dunford
O Hernández-Lerma
O Passchier
P Billingsley
PK Dutta
R Amir
SP Meyn
SP Meyn
T Parthasarathy
T Parthasarathy
TB Bielecki
U Rieder
VS Borkar
W Whitt
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2003
Field of study

In this chapter, we present a framework for m-person stochastic games with an infinite state space. Our main purpose is to present a correlated equilibrium theorem proved by Nowak and Raghavan [42] for discounted stochastic games with a measurable state space, where the correlation o

CiteSeerX

Crossref

Controlled Markov Chains on a Countable State Space: Some Recent Results

Author: VS Borkar
VS Borkar
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1990
Field of study

Crossref

A Remark on Control of Partially Observed Markov Chains

Author: Borkar VS
Publication venue: Springer
Publication date
Field of study

A new state variable is introduced for the problem of controlling a Markov chain under partial observations, which, under a suitably altered probability measure, has a simple evolution

Open Access Repository of IISc Research Publications